Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Lingo - ein open source System für die Automatische indexierung deutschsprachiger Dokumente

Identifieur interne : 001191 ( Main/Exploration ); précédent : 001190; suivant : 001192

Lingo - ein open source System für die Automatische indexierung deutschsprachiger Dokumente

Auteurs : Ein Beitrag Von Klaus Lepsky [Allemagne] ; John Vorhauer [Allemagne]

Source :

RBID : Pascal:06-0316033

Descripteurs français

English descriptors

Abstract

Lingo is an open source software system for automatic indexing of german language documents. The development was determined by the aspects of flexibility, easy configuration, and different applications. The contribution deals with the advantage of a linguistic based automatic indexing system which will improve information retrieval. The available linguistic functionality of lingo is presented and explained via examples. Stemming, recognition and separation of composite words, lexical and algorithmic recognition of phrases and correction of OCR defaults are indicated too. Lingo's open system architecture, possible fields of application, and their boundaries are described.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="GER" level="a">Lingo - ein open source System für die Automatische indexierung deutschsprachiger Dokumente</title>
<author>
<name sortKey="Von Klaus Lepsky, Ein Beitrag" sort="Von Klaus Lepsky, Ein Beitrag" uniqKey="Von Klaus Lepsky E" first="Ein Beitrag" last="Von Klaus Lepsky">Ein Beitrag Von Klaus Lepsky</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>Institut für Informations- wissenschaft Fachhochschule Köln Claudiusstrasse 1</s1>
<s2>50678 Köln</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Cologne</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Vorhauer, John" sort="Vorhauer, John" uniqKey="Vorhauer J" first="John" last="Vorhauer">John Vorhauer</name>
<affiliation wicri:level="3">
<inist:fA14 i1="02">
<s2>Gustavstrasse 6 50937 Köln</s2>
<s3>DEU</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Cologne</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">06-0316033</idno>
<date when="2006">2006</date>
<idno type="stanalyst">PASCAL 06-0316033 INIST</idno>
<idno type="RBID">Pascal:06-0316033</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000382</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000404</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000364</idno>
<idno type="wicri:doubleKey">0720-6763:2006:Von Klaus Lepsky E:lingo:ein:open</idno>
<idno type="wicri:Area/Main/Merge">001225</idno>
<idno type="wicri:Area/Main/Curation">001191</idno>
<idno type="wicri:Area/Main/Exploration">001191</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="GER" level="a">Lingo - ein open source System für die Automatische indexierung deutschsprachiger Dokumente</title>
<author>
<name sortKey="Von Klaus Lepsky, Ein Beitrag" sort="Von Klaus Lepsky, Ein Beitrag" uniqKey="Von Klaus Lepsky E" first="Ein Beitrag" last="Von Klaus Lepsky">Ein Beitrag Von Klaus Lepsky</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>Institut für Informations- wissenschaft Fachhochschule Köln Claudiusstrasse 1</s1>
<s2>50678 Köln</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Cologne</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Vorhauer, John" sort="Vorhauer, John" uniqKey="Vorhauer J" first="John" last="Vorhauer">John Vorhauer</name>
<affiliation wicri:level="3">
<inist:fA14 i1="02">
<s2>Gustavstrasse 6 50937 Köln</s2>
<s3>DEU</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Cologne</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">ABI - Technik</title>
<title level="j" type="abbreviated">ABI - Tech.</title>
<idno type="ISSN">0720-6763</idno>
<imprint>
<date when="2006">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">ABI - Technik</title>
<title level="j" type="abbreviated">ABI - Tech.</title>
<idno type="ISSN">0720-6763</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Automatic indexing</term>
<term>Document processing</term>
<term>German</term>
<term>Information retrieval</term>
<term>Language processing</term>
<term>Linguistic analysis</term>
<term>Open source software</term>
<term>System architecture</term>
<term>System description</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Logiciel libre</term>
<term>Traitement document</term>
<term>Traitement langage</term>
<term>Allemand</term>
<term>Indexation automatique</term>
<term>Recherche information</term>
<term>Analyse linguistique</term>
<term>Architecture système</term>
<term>Description système</term>
<term>Domaine d'application</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Lingo is an open source software system for automatic indexing of german language documents. The development was determined by the aspects of flexibility, easy configuration, and different applications. The contribution deals with the advantage of a linguistic based automatic indexing system which will improve information retrieval. The available linguistic functionality of lingo is presented and explained via examples. Stemming, recognition and separation of composite words, lexical and algorithmic recognition of phrases and correction of OCR defaults are indicated too. Lingo's open system architecture, possible fields of application, and their boundaries are described.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Allemagne</li>
</country>
<region>
<li>District de Cologne</li>
<li>Rhénanie-du-Nord-Westphalie</li>
</region>
<settlement>
<li>Cologne</li>
</settlement>
</list>
<tree>
<country name="Allemagne">
<region name="Rhénanie-du-Nord-Westphalie">
<name sortKey="Von Klaus Lepsky, Ein Beitrag" sort="Von Klaus Lepsky, Ein Beitrag" uniqKey="Von Klaus Lepsky E" first="Ein Beitrag" last="Von Klaus Lepsky">Ein Beitrag Von Klaus Lepsky</name>
</region>
<name sortKey="Vorhauer, John" sort="Vorhauer, John" uniqKey="Vorhauer J" first="John" last="Vorhauer">John Vorhauer</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001191 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001191 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:06-0316033
   |texte=   Lingo - ein open source System für die Automatische indexierung deutschsprachiger Dokumente
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024